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Abstract 

These notes give a brief introduction to the mo- 
tivations, concepts, and properties of distributions, 
which generahze the notion of functions f{x) to al- 
low derivatives of discontinuities, "delta" functions, 
and other nice things. This generalization is in- 
creasingly important the more you work with linear 
PDEs, as we do in 18.303. For example. Green's func- 
tions are extremely cumbersome if one does not al- 
low delta functions. Moreover, solving PDEs with 
functions that are not classically differentiable is of 
great practical importance (e.g. a plucked string with 
a triangle shape is not twice differentiable, making 
the wave equation problematic with traditional func- 
tions). Any serious work with PDEs will eventually 
run into the concept of a "weak solution," which is 
essentially a version of the distribution concept. 



X = 0. That is, one would like the function S{x) = 
for all X ^ 0, but with / S{x)dx = 1 for any in- 
tegration region that includes x = 0; this concept 
is called a "Dirac delta function" or simply a "delta 
function." 6{x) is usually the simplest right-hand- 
side for which to solve differential equations, yielding 
a Green's function. It is also the simplest way to 
consider physical effects that are concentrated within 
very small volumes or times, for which you don't ac- 
tually want to worry about the microscopic details 
in this volume — for example, think of the concepts of 
a "point charge," a "point mass," a force plucking a 
string at "one point," a "kick" that "suddenly" imparts 
some momentum to an object, and so on. The prob- 
lem is that there is no classical function 6{x) having 
these properties. 

For example, one could imagine constructing this 
function as the limit: 



1 What's wrong with functions? 

The most familiar notion of a function f{x) is a map 

from real numbers M to real numbers M (or maybe 
complex numbers C); that is, for every x you have 
a value y = f{x). Of course, you may know that 
one can define functions from any set to any other 
set, but at first glance it seems that M ^ M func- 
tions (and multi-dimensional generalizations thereof) 
are the best-suited concept for describing most phys- 
ical quantities — for example, velocity as a function 
of time, or pressure or density as a function of posi- 
tion in a fluid, and so on. Unfortunately, such func- 
tions have some severe drawbacks that, eventually, 
lead them to be replaced in some contexts by an- 
other concept: distributions (also called generalized 
functions). What are these drawbacks? 

1.1 No delta functions 

For lots of applications, such as those involving PDEs 
and Green's functions, one would like to have a func- 
tion 6{x) whose integral is "concentrated" at the point 



S(x) = lim 



l^l<Aa;/2 
otherwise 



lim dAxix)? 



For any Ax > 0, the function dAx{x) at right has 
integral = 1 and is zero except near a; = 0. Un- 
fortunately, the Ax — > limit does not exist as an 
ordinary function: 6ax{x) approaches oo for x = 0, 
but of course oo is not a real number. 

Informally, one often sees "definitions" of S{x) that 
describe it as some mysterious object that is "not 
quite" a function, which = for a; 7^ but is unde- 
fined at X = 0, and which is "only really defined inside 
an integral" (where it "integrates" to 1).^ This may 
leave you with a queasy feeling that 5{x) is somehow 



^Historically, this unsatisfactory description is precisely 
how S{x) first appeared, and for many years there was a corre- 
sponding cloud of guilt and uncertainty surrounding any usage 
of it. Most famously, an informal S{x) notion was popularized 
by physicist Paul Dirac, who in his Principles of Quantum Me- 
chanics (1930) wrote: "Thus d{x) is not a quantity which can 
be generally used in mathematical analysis like an ordinary 
function, but its use must be confined to certain simple types 
of expression for which it is obvious that no inconsistency can 
arise." You know you are on shaky ground, in mathematics, 
when you are forced to appeal to the "obvious." 
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not real or rigorous (and therefore anything based on 
it may be suspect). For example, integration is an 
operation that is classically only defined for ordinary 
functions, so it may not even be clear (yet) what "J" 
means when we write "/ 6{x)dx". 

1.2 Not all functions cire differentiable 

Most physical laws can be written in the form of 

derivatives, but lots of functions are not differen- 
tiable. Discontinuous functions arise all of the time 
at the interface between two materials (e.g. think 
of the density at the interface between glass and air) . 
Of course, at a microscopic level you could argue that 
the quantum wavefunctions might be continuous, but 
one hardly wishes to resort to atoms and quantum 
mechanics every time a material has a boundary! 

The classic example of a discontinuous function is 
the Heaviside step function: 



1 x>0 
X < 



The derivative S'{x) is zero everywhere except at x = 
0, where the derivative does not exist — the slope at 
a; = is "infinity." Notice that H'(x) very much 
resembles 5{x), and S{x')dx' would certainly look 
something like S{x) if it existed (since the "integral" 
of S should be 1 for a; > 0) — ^this is not a coincidence, 
and it would be a shame not to exploit it! 

A function doesn't need to be discontinuous to lack 
a derivative. Consider the function |a;|: its derivative 
is +1 for .T > and — 1 for x < 0, but at x = 
the derivative doesn't exist. We say that \x\ is only 
"piecewise" differentiable. 

Note that S{x) is very useful for writing down 
all sorts of functions with discontinuities. |x| = 
xS{x) — xS{—x), for example, and the Sax{x) "box" 
function on the right-hand-side of the S{x) "defini- 
tion" above can be written 6ax{x) = [S{x + Ax/2) — 
S{x- Ax/2)]/Ax. 

1.3 Nagging worries about 

discrepancies at isolated points 

When we try to do linear algebra with fiiiictions, we 
continually find ourselves worrying about excluding 
odd caveats and counterexamples that have to do 
with finite discrepancies at isolated points. For ex- 
ample, a Fourier series of a square-integrable function 
converges everywhere. . . except at isolated points of 
discontinuity [like the point a; = for S{x), where 
a Fourier series would converge to 0.5]. As an- 
other example, {u,v) = f uv defines an inner prod- 
uct on functions and a norm = {u,u) > for 



u ^ 0. . . except for u{x) that are only nonzero for iso- 
lated points. And with functions like S{x) there are 
all sorts of apparently pointless questions about what 
value to assign exactly at the discontinuity. It may 
likewise seem odd to care about what value to assign 
the slope of \x\ at a; = 0; surely any value in [—1, 1] 
should do? 

1.4 Limits and derivatives 

cannot always be interchanged 

In numerical and analytical PDE methods, we are 

continually writing functions as limits. A Fourier se- 
ries is a limit as the number of terms goes to infinity. 
A finite-difference method solves the problem in the 
limit as Ax 0. We initially find the Green's func- 
tion as the limit of the response to a box-like function 
that is nonzero outside of a width Aa;, and then take 
the limit Ax 0. After all of these kinds of things, 
we eventually substitute our solution back into the 
PDE, and we assume that the limit of the solution is 
still a solution. In doing so, however, we usually end 
up interchanging the limits and the differentiation. 
For example, we usually differentiate a Fourier series 
by: 



dx 



a„ sin(n7ra;) = j~ sin(n7ra;) = n7ra„ cos(n7ra;), 

n=l n=l n=l 

but this is not always true if we interpret "=" in the 
usual sense of being true for every x. The general 
problem is that one cannot always interchange limits 
and derivatives, i.e. ^ lim =/= lim^. (Note that 

J2n=i is really a limit limjv^cx> J2n=i-) 
A simple example of this is the function 



—€ x < —e 

x —e<x<e, 

e X > € 



m = 



—€< X < e 
\x\ > e 



The limit as e — > of fe{x) is simply zero. But 
/^(O) = 1 for all e, so 







d_ 
dx 



lim /, 



7^1 



lim 

£^0 



d 

dx 



Notice, however, that this mismatch only occurs at 
one isolated point x = 0. The same thing happens 
for Fourier series: differentiation term-by-term works 
except at isolated points. Thus, this problem returns 
to the complaint in the previous section 1.3. 

1.5 Too much information 

A function f{x) gives us a value at every point x, but 
does this really correspond to a measurable quantity 
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in the physical universe? How would you measure the 
velocity at one instant in time, or the density at one 
point in a fluid? Of course, your measurement device 
could be very precise, and very small, and very fast, 
but in the end all you can ever measure are averages 
of f{x) over a small region of space and/or time. 

But an average is the same thing as an integral 
/ f{x) over the averaging region. More generally, in- 
stead of averaging f{x) uniformly in some region, we 
could average with some weights (j){x) (e.g. our de- 
vice could be more sensitive to some points than oth- 
ers). Thus, the only physical question we can ever 
ask about a function is the value of an integral 

/oo 
f{x)(f){x)dx 
-oo 

of f{x) against a test function 0(a;). 

But if all we can ever ask is such an integral, why 
are we worrying about isolated points? In fact, why 
do we even define f{x) to have values at points at 
all? In a physical application of mathematics, per- 
haps we should only define things that we can mea- 
sure! This insight fixes every one of the problems 
above, and leads to the concept of distributions. 

2 Distributions 

The old kind of function is a map from M ^ M: given 
an X. we get a value f{x). Following section 1.5, 
this is too much information; we can only ask for an 
"average" value given some weight function 4>{x). So, 
we make a new definition of "function" that provides 
this information, and only this information: 

• / is a rule that given any test function 4>{x) re- 
turns a number f{(l>}-^ 

This new definition of a "function" is called a dis- 
tribution or a generalized function. We are no 

longer allowed to ask the value at a point x. This 
will fix all of the problems with the old functions from 

above. However, we should be more precise about our 
definition. First, we have to specify what 4>{x) can 
be: 

• (f){x) is an ordinary function M — > M (not a dis- 
tribution) in some set V. We require 4>{x) to be 
infinitely differentiable. We also require 4>{x) to 
be nonzero only in some finite region (the "sup- 
port" of 0): is a smooth "bump" function.^ 

^Many authors use the notation (/, cp) instead of /{</>}, but 
we are already using {•, •) for inner products and I don't want 
to confuse matters. 

^Alternatively, one sometimes loosens this requirement to 

merely say that d>{j:) must quickly as a; — > dioo, and in 
particular that x^^<p(x) — > as — » itoo for all integers n > 0. 



[The generalization to functions (p{x) for x G ffi**, with 
the distributions corresponding to d-dimensional in- 
tegrals, is very straightforward, but we will stick to 
d = 1 for simplicity.] Second,^ we require that f{(f>} 
act like integration in that it must be linear: 

• f{a(l)i + I3(j>2} = af{(l)i} + I3f{(j)2} for any num- 
bers a, /? G M and any 4'i,(t>2 <= 'D- 

Thus, / is a linear map from 2? ^ M, and the set 
of all distributions for a given set of test functions 
V is soinetinies denoted V . There are two classes of 
distributions: regular and singular distributions. 

2.1 Regulair distributions 

from ordinciry functions f{x) 

The most obvious way to define a distribution /{0} is 
simply an ordinary integral of an ordinary function: 
given an ordinary function f{x), we can define the 
distribution: 

/oo 
f{x)^{x)dx. 
-oo 

This is called a regular distribution. 

Not all ordinary functions f{x) define regular dis- 
tributions: we must have / f(j) finite for all 4> ^ T). 
This reduces to requiring that ^^\f{x)\dx < oo for 
all intervals [a,b] (/ is "locally integrable"). 

2.2 Singular distributions 
& the delta function 

Although the integral of an ordinary function is one 
way to define a distribution, it is not the only way. 
For example, the following distribution is a perfectly 
good one: 

m = m- 

This rule is linear and continuous, there are no 
weird infinities, nor is there anything spooky or non- 
rigorous. Given a test function (j){x), the S{x) distri- 
bution is simply the rule that gives (f){0) from each (p. 
S{x), however, does not correspond to any ordinary 
function — it is not a regular distribution — so we call 
it a singular distribution. 

• Notation: when we write "/ S{x)<j){x)dx", we 
don't mean an ordinary integral, we really mean 



■^There is also a third requirement: /{</>} must be continu- 
ous, in that if you change (/>(a;) continuously the value of f{4>} 
must change continuously. In practice, you will never violate 
this condition unless you are trying to. 
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Furthcrinorc. when we say S{x — x'), we just mean 
the distribution 6{x — = ^(a;'). 

If we look at the "finite-Aa; delta" approximation 
^Ax{x) from section 1.1, that defines the regular dis- 
tribution: 



5ax{(I>} 



1 
Ax 



Ax/2 
-Ax/2 



{x)dx, 



which is just the average of 4'{x) in [— Aa;/2, Aa;/2]. 
Now, however, viewed as a distribution, the limit 
Ax ^ is perfectly well defined:® limAx^o ^Axi^P} = 
0(0) = S{(l)}, i.e. Sax ^ S. 

Of course, the S distribution is not the only singular 
distribution, but it is the most famous one (and the 
one from which many other singular distributions are 
built). We will see more examples later. 

2.3 Derivatives of distributions 

&c differentiating discontinuities 

How do we define the derivative /' of a distribution? 
Well, at the very least we want it to be the same as 
the ordinary derivative when / is a regular distribu- 
tion f{x) that happens to be differentiable in the or- 
dinary sense. In that case, /'{?!>} = / f'{x)^{x)dx = 
— J f{x)(j)' {x)dx = /{—(/)'}, where we have integrated 
by parts and used the fact that 0(a;) is zero outside a 
finite region to eliminate the boundary terms. This 
is such a nice result that we will use it to define the 
derivative of any distribution: 

• The distributional derivative /' of /{</>} is 
given by the distribution /'{<^} = /{— 0'}, where 
(l)'{x) is the ordinary derivative of ^(a;). 

(This is sometimes also called a weak derivative.) 
Since the test functions 4'{x) were required to be in- 
finitely differentiable, we have a remarkable conse- 
quence: every distribution is infinitely differentiable 
(in the distributional sense). 

For example, since (5{0} = ^^(0), it immediately 
follows that the derivative of a delta function is the 
distribution 6'{(j)} = S{-(j)'} = -0'(O). 

The most important consequence of this definition 
is that even discontinuous functions are differ- 
entiable as distributions, and their derivatives give 
delta functions for each discontinuity. Consider the 
regular distribution S defined by the step function 
Six): 



S{x)(j){x)dx 



f 

Jo 



(f){x)dx. 



*We have used the fact that (t>(x) is required to be continu- 
ous, from which one can show that nothing weird can happen 
with the average of (pi^) ^■s Ax — > 0. 



It innncdiatcly follows that the distributional deriva- 
tive of the step function is 



/•oo 

= S{-(f>'} = - / (t)'ix)da 

JQ 



But this is exactly the same as 5{(t)], so we immedi- 
ately conclude: 



S' = 5 . 



Since any function with jump discontinuities can be 

written in terms of S{x), we find that the derivative 
of any jump discontinuity gives a delta function mul- 
tiplied by the magnitude of the jump. For example, 
consider: 



he a; > 3 



-3). 



The distributional derivative works just like the ordi- 
nary derivative, except that S' = S, so^ 

f'{x) = 2x + (3a;^ - 2a;)S'(a; -3) + (3^ - 3^)5(a; - 3) 

.Of/ ON 1 2a; a; < 3 
= 186{x - 3) + <^ , 
I 3a; a; > 3 

where of course by f'{x) I mean the distribution 

/'{(/)}. It is common to be "casual" with notation in 
this way for distributions, treating them like ordinary 
functions, but you have to remember that you can't 
evaluate them at any point x, you can only evaluate 
them for test functions (/)(a;). 

2.4 Isolated points 

With ordinary functions, we had to make lots of 
caveats about isolated points. No more with distri- 
butions. The key point is that two different ordinary 
functions can define the same distribution. Consider, 
for example, the function 



1 x = 
otherwise 



This is not a delta function: it is finite at a; = 0, and 
is a perfectly acceptable function. It also defines a 
regular distribution: 



J{x)(l){x)dx = 0. 



^I'm being a bit glib here. How do we know that the product 
rule works the same? Below, we will rigorously define what it 

means to multiply a distribution by a smooth function like 
— , and the ordinary product rule will follow. 
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Think about it: no matter what is. the integral 
must give zero because it is only nonzero (by a fi- 
nite amount) at a single point, with zero area ("zero 
measure" for the pure-math folks). Thus, in the dis- 
tribution sense, we can say perfectly rigorously that 

/ = 

even though f{x) 7^ in the ordinary-function sense! 

In general, any two ordinary functions that only 
differ (by finite amounts — not delta functions!) at 
isolated points (a "set of measure zero") define the 
same regular distribution. We no longer have to 
make caveats about isolated points — finite values at 
isolated points make no difference to a distribution. 

For example, there are no more caveats about the 
Fourier series or Fourier transforms: they converge, 
period, for distributions.'' 

Also, there is no more quibbling about the value of 
things like S{x) right at the point of discontinuities. 
It doesn't matter, for distributions. Nor is there quib- 
bling about the derivative of things like right at 
the point of the slope discontinuity. It is an easy mat- 
ter to show that the distributional derivative of |a;| is 
simply 2S{x) — 1, i.e. it is the regular distribution 
corresponding to the function that is +1 for a; > 
and — 1 for .T < (with the value at a; = being 
rigorously irrelevant) . 

2.5 Interchanging 

limits and derivatives 

With a distribution. limits and (distributional) 
derivatives can always be interchanged. This is 
tremendously useful when talking about PDEs and 
convergence of approximations. In the distribution 
sense, the Fourier series can always be differentiated 
term-by-term, for example. 

This is easy to prove. Suppose that the distribu- 
tions /„ ^ / as n ^ 00. That is, for any (j){x), 
fn{<P} fW}- Since this is true for any (/>(a;), 
it must be true for —(j)'{x), and hence f = 
/«{-</>'} ^ /{-</-'} = as n ^ 00. Q.E.D. 

3 Rules for distributions 

For the most part, in 18.303, we will cheat a bit. 
We will treat things as ordinary functions when- 
ever we can, using the ordinary operations, and only 

'^Technically, we have to choose distributions with the right 
set of test functions. The right test functions for the general 
Fourier transform on the real line are those for which x"4'{x) —* 
as a; — + ±00 for any n > 0, i.e. 0(a;) vanishes faster than any 
polynomial. The resulting distributions are called tempered 
distributions, and are the domain of the Fourier transform. 



switch to interpreting them as distributions when we 
run into difficulty (e.g. derivatives of discontinuities, 
etcetera). Since the rules for distribution operations 
are all defined to be consistent with those for func- 
tions in the case of regular distributions, this doesn't 
usually cause any trouble. 

However, it is good to define some of the important 
operations on distributions precisely. All you have to 
do is to explain what the operation does to test func- 
tions, usually defined by analogy with J f{x)<p{x)dx 
for regular distributions. Here are a few of the most 
basic operations: 

• differentiation: /'{0} — f {—(}>} 

• addition: (A + /2){0} = fi{(b} + /2{c6} 

• multiplication by smooth functions (including 
constants) s{x): [s{x) ■ f]{(j)} = f{s{x)(l){x)} 

• product rule for multiplication by smooth func- 
tions six): [s- /]'{</>} = [s- f]{-<P'} = /{-s<A'} = 
f{s'^ - (s0)'} = f{s'cb} + f'{s<P} = [s' ■ fm + 

[s-rm^w-f+s-rm. 

• translation: [f{x - y)]{(t>{x)} = f{(t){x + y)} 

• scaling [/(aa;)]{^(a;)} = ^/{(pix/a)} 

If you are not sure where these rules come from, just 
try plugging them into a regular distribution /{^} = 
/ f{x)(j){x)dx, and you'll see that they work out in 
the ordinary way. 

4 Problems with distributions 

Unfortunately, distributions are not a free lunch; they 

come with their own headaches. There are two ma- 
jor difficulties, one of which is surmountable and the 
other is not: 

• Boundary conditions: since distributions do not 

have values at individual points, it is not so easy 
to impose boundary conditions on the solutions 
if they are viewed as distributions — what does it 
mean to set u(0) — 0? There are ways around 
this, but they are a bit cumbersome, especially 
in more than one dimension. 

• Multiplication: it is not generally meaningful 
to multiply distributions. The simplest exam- 
ple is the delta function: what would 6{x)^ be? 
Sax{x)'^ is okay, but its limit as Ax does 
not exist even as a distribution (the amplitude 
goes as 1 / Ax^ while the integral goes as Ax, so 
it diverges). 
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For linear PDEs, lack of multiplication is not such 
a big problem, but it does mean that we need to 
be careful about Hilbert spaces: if we think of the 
solutions u{x) as distributions, we have a problem 
because (u, u) may not be defined — the set of distri- 
butions does not form a Hilbert space. (Technically, 
we can make something called a rigged Hilbert space 
that includes distributions, but I don't want to go 
there.) 



5 The weak form of a PDE 

Suppose we have a linear PDE Au = f. We want 
to allow / to be a delta function etcetera, but we 
still want to talk about boundary conditions, Hilbert 
spaces, and so on for u. There is a relatively simple 
compromise for linear PDEs, called the weak form 
of the PDE or the weak solution. This concept 
can roughly be described as requiring Au = f only 
in the "weak" sense of distributions (i.e., integrated 
against test functions, taking distributional deriva- 
tives in the case of a u with discontinuities), but re- 
quiring u to be in a more restrictive class, e.g. regu- 
lar distributions corresponding to functions satisfying 
the boundary conditions in the ordinary sense but 
having some continuity and differentiability so that 
(w, u) and {u, Au) are finite (i.e., living in an appropri- 
ate Sobolev space). Even this definition gets rather 
technical very quickly, especially if you want to allow 
delta functions for / (in which case u can blow up at 
the point of the delta for dimensions > 1). Nailing 
down precisely what spaces of functions and opera- 
tors one is dealing with is where a lot of technical 
difficulty and obscurity arises in functional analysis. 
However, for practical engineering and science appli- 
cations, we can get away with being a little careless in 
precisely how we define the function spaces because 
the weird counterexamples are usually obviously un- 
physical. The key insight of distributions is that what 
we care about is weak equality, not pointwise equality, 
and correspondingly we only need weak derivatives 
(integration by parts, or technically "bilinear forms"). 



6 Green's functions 

Now that we have distributions, Green's functions are 
much easier to work with. Consider, for example, the 
Green's function G{x,x') oi A = — on [0, L] with 
Dirichlet (zero) boundary conditions. If Au = f is 
to be solved by u{x) = J G{x,x')f{x')dx', then we 
must have Au = J[AG{x,x')]f{x')dx' = f{x), which 



is true if 

AGix, x') = S{x - x') 

and the integrals are re-interpreted as evaluating a 
distribution. 

What does this equation mean? For any x ^ x' 
[or for any (p^x) with (j){x') = 0] we must have 
AG{x,x') = = —-^G{x,x'), and this must mean 
that G{x, x') is a straight line for x < x' and x > x' . 
To satisfy, the boundary conditions, this straight line 
must pass through zero at and L, and hence G{x, x') 
must look like ax for x < x' and l3{x — L) for x > x' 
for some constants a and /3. 

G{x,x') had better be continuous at x = x' , or 
otherwise we would get a delta function from the 
first derivative — hence a = (3{x' — Vj/x' . The first 
derivative ^G{x,x') then gives a for x < x' and /3 
for X > .t'. What about the next derivative? Since 
^G{x,x') is discontinuous, it doesn't have an ordi- 
nary second derivative at x = x', but as a distribu- 
tion it is no problem: -^G{x,x') is zero everywhere 
(the derivative of a constant) plus a delta function 
5{x — x') multiplied by /? — a, the size of the jump. 
Thus, -|^G(x,x') = AG ={a- I3)5{x - x'), and 
from above we must have a — (3 = 1 . Combined with 
the equation for a from continuity of G, we obtain 
/? = —x'/L and a = 1 — x' /L, exactly the same as 
our result from class (which we got by a more labo- 
rious method). 

Further reading 

• I. M. Gel'fand and G. E. Shilov, Generalized 
Functions, Volume I: Properties and Operations 
(New York: Academic Press, 1964). [Out of 
print, but still my favorite book on the subject.] 

• Robert S. Strichartz, A Guide to Distribu- 
tion Theory and Fourier Transforms (Singapore: 
World Scientific, 1994). 

• Jesper Lutzen, The Prehistory of the Theory of 
Distributions (New York: Springer, 1982). [A 
fascinating book describing the painful historical 
process that led up to distribution theory.] 
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Distributions in a nutshell 



• Delta functions axe okay. You can employ their 

informal description without guilt because there 
is a rigorous definition to fall back on in case of 
doubt. 

— An "integral" S{x — x')(j){x)dx just 
means 4'{x'). Integrals over finite domains 
Jj^ 5{x — x')(f){x) give (j>{x') if x' is in the 
interior of Vt and if x' is outside Q, but 
are undefined (or at least, more care is re- 
quired) if x' is on the boundary dQ. 

• When in doubt about how to compute f'{x), 
integrate by parts against a test function to 
see what / f'{x)<p{x) = — J f{x)(()'{x) does (the 
"weak" or "distributional" derivative) . 

— A derivative of a discontinuity at x' gives 
S{x—x') multiplied by the size of the discon- 
tinuity [the difference f{x'^) — f{x'~)], plus 
the ordinary derivative everywhere else. 

— This also applies to differentiating func- 
tions like 1/y/x that have finite integrals 
but whose classical derivatives have diver- 
gent integrals — applying the weak deriva- 
tive instead produces a well-defined distri- 
bution. [For example, this procedure fa- 
mously yields V^^ = —4:nS{x) in 3d.] 

• All that matters in the distribution (weak) sense 

is the integral of a function times a smooth, lo- 
calized test function (/'(x). Anything that doesn't 
change such integrals J f{x)(j){x), like finite val- 
ues of /(.t) at isolated points, doesn't matter. 
(That is, whenever we use "=" for functions we 
are almost always talking about weak equality.) 

• Interchanging limits and derivatives is okay in 

the distribution sense. Differentiating Fourier 
series (and other expansions in infinite bases) 
term-by-term is okay. 

• In practice, we only ever need to solve PDEs in 
the distribution sense (a "weak" solution): inte- 
grating the left- and right-hand sides against any 
test functions must give the same number, with 
all derivatives taken in the weak sense. 
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